Word Extraction in Text/Graphic Mixed Image Using 3-Dimensional Graph Model

نویسندگان

  • Hwan-chul Park
  • Se-young Ok
  • Hwan-gue Cho
چکیده

Automatic Text location, character recognition and image understanding of a given paper document are main objectives of computer vision area. The rst stage for these problems is extracting text information and separating graphic symbol from texts. Previous text location algorithm could not extract the negative text(e.g., newspaper headline) which is a white colored text on some solid background color plane. Also they could extract only the horizontal or vertical text in a document, so the inclined text or text on a circular arc can not be located by the previous works. In this paper, we propose a new extracting method for these negative texts and real texts from a text/graphics mixed document image. Also we propose a new word grouping method when texts are intersected each other or placed on a circular arc or an inclined line segment with an arbitrary orientation. The basic strategy of our algorithm is based on the frequency analysis of the run-length encoded le of the image segment. Generally the number of runs in the run-length encoding for a text(character) is smaller than that of graphic symbol. And the average and variance of the number of runs in a run-length encoding gives a nice characterization of symbols and texts. After isolating each letter in a document le, we need to group the related letters a word. This procedure is a crucial work for an automatic document processing, since the unit of the nal output of the document processing should be a word or a statement. For this procedure, we propose 3 dimensional neighborhood graph for grouping words and statement from a set of isolated letters obtained from the rst letter isolating phase. This graph maps each letter in a document to a vertex in 3-dimensional space according to the size of that letter. Experimental results show that more than 97% of words were successfully extracted from the text/graphics mixed document including negative texts. This result shows the usefulness of our character isolating algorithm and our 3-dimensional graph mapping for the document with oriental characters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS

Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...

متن کامل

A Novel Method for Content Base Image Retrieval Using Combination of Local and Global Features

Content-based image retrieval (CBIR) has been an active research topic in the last decade. In this paper we proposed an image retrieval method using global and local features. Firstly, for local features extraction, SURF algorithm produces a set of interest points for each image and a set of 64-dimensional descriptors for each interest points and then to use Bag of Visual Words model, a cluster...

متن کامل

A Novel Method for Content Base Image Retrieval Using Combination of Local and Global Features

Content-based image retrieval (CBIR) has been an active research topic in the last decade. In this paper we proposed an image retrieval method using global and local features. Firstly, for local features extraction, SURF algorithm produces a set of interest points for each image and a set of 64-dimensional descriptors for each interest points and then to use Bag of Visual Words model, a cluster...

متن کامل

A Comparative Study of Segmentation in Mixed-Mode Images

The detection and extraction of text regions in an image is a well known problem in the computer vision research area. Text extraction is a critical and essential step as it sets up the quality of the final recognition result. It aims at segmenting text from background, i.e isolating text pixels from those of background. Since readymade mixed mode image data is not available, it is necessary to...

متن کامل

Directional Stroke Width Transform to Separate Text and Graphics in City Maps

One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001